SEQUENCE LIST 



SEQ ID NO.: 1 
Length: 417 amino acid 
Type: amino acid 

Topology: linear 
Molecule type: protein 
Sequence 

Met Ala Leu Lys Val Leu Leu Glu, Gin Glu Lys Thr Phe Phe Thr Leu 

1 5 \ 10 15 

Leu Val Leu Leu Gly Tyr Leu Ser \3ys Lys Val Thr Cys Glu Thr Gly 

20 ^5 30 

Asp Cys Arg Gin Gin Glu Phe Arg As\? Arg Ser Gly Asn Cys Val Pro 

35 40 \ 45 

Cys Asn Gin Cys Gly Pro Gly Met Glu \,eu Ser Lys Glu Cys Gly Phe 

50 55 \ 60 

Gly Tyr Gly Glu Asp Ala Gin Cys Val Th\> Cys Arg Leu His Arg Phe 
65 70 \ 75 80 

ys Glu Asp Trp Gly Phe Gin Lys Cys Lys ^ro Cys Leu Asp Cys Ala 

85 90 \ 95 

Val Val Asn Arg Phe Gin Lys Ala Asn Cys Se\Ala Thr Ser Asp Ala 

100 105 \ 110 

He Cys Gly Asp Cys Leu Pro Gly Phe Tyr Arg Ly^ Thr Lys Leu Val 

115 120 V25 

Gly Phe Gin Asp Met Glu Cys Val Pro Cys Gly Asp ?hq Pro Pro Pro 
130 135 140 

40 



U 10 



15 



20 



25 



Tyr Glu Pro His Cys Ala Ser Lys Val Asn Leu Val Lys lie Ala Ser 
145 150 155. 160 

Thr Ala Ser Ser Pro Arg Asp Thr Ala Leu Ala Ala Val He Cys Ser 

165 170 175 

Ala Leu Ala Thr Val Leu Leu Ala Leu Leu lie Leu Cys Val He Tyr 

180 185 190 

Cys Lys Arg Gin Phe Met Glu Lys Lys Pro Ser Trp Ser Leu Arg Ser 

195 200 205 

Gin Asp He Gin Tyr Asn Gly Ser Glu Leu Ser Cys Leu Asp Pro Arg 

210 215 220 

Gin Leu His Glu Tyr Ala His Arg Ala Cys Cys Gin Cys Arg Arg Asp 
225 230 235 240 

Ser Val Gin Thr Cys Gly Pro Val Arg Leu Leu Pro Ser Met Cys Cys 

245 250 255 

Glu Glu Ala Cys Ser Pro Asn Pro Ala Thr Leu Gly Cys Gly Val His 

260 265 270 

Ser Ala Ala Ser Leu Gin Ala Arg Asn Ala Gly Pro Ala Gly Glu Met 

275 280 285 

Val Pro Thr Phe Phe Gly Ser Leu Thr Gin Ser He Cys Gly Glu Phe 

290 295 300 

Ser Asp Ala Trp Pro Leu Met Gin Asn Pro Met Gly Gly Asp Asn He 
305 310 315 320 

Ser Phe Cys Asp Ser Tyr Pro Glu Leu Thr Gly Glu Asp He His Ser 

325 330 335 

Leu Asn Pro Glu Leu Glu Ser Ser Thr Ser Leu Asp Ser Asn Ser Ser 
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p 

ri i 



340 345 350 

Gin Asp Leu Val Gly Gly Ala Val Pro Val Gin Ser His Ser Glu Asn 

355 360 365 

Phe Thr Ala Ala Thr Asp Leu Ser Arg Tyr Asn Asn Thr Leu Val Glu 
5 370 375 380 

Ser Ala Ser Thr Gin Asp Ala Leu Thr Met Arg Ser Gin Leu Asp Gin 
385 390 395 400 

Glu Ser Gly Ala He lie His Pro Ala Thr Gin Thr Ser Leu Gin Glu 
405 410 415 

io Ala 

SEQ ID NO. : 2 
Length: 1269 base pairs 

Type: nucleic acid 

15 Strandness: single 

Topology: linear 

Molecule type: cDNA to mRNA 
Sequence 



2 0 ATGGCTTTAA AAGTGCTACT AGAACAAGAG 
GGCTATTTGT CATGTAAAGT GACTTGTGAA 
GATCGGTCTG GAAACTGTGT TCCCTGCAAC 
GAATGTGGCT TCGGCTATGG GGAGGATGCA 
AAGGAGGACT GGGGCTTCCA GAAATGCAAG 

25 TTTCAGAAGG CAAATTGTTC AGCCACCAGT 



AAAACGTTTT 


TCACTCTTTT 


AGTATTACTA 


60 


ACAGGAGACT 


GTAGACAGCA 


AGAATTCAGG 


120 


CAGTGTGGGC 


CAGGCATGGA 


GTTGTCTAAG 


180 


CAGTGTGTGA 


CGTGCCGGCT 


GCACAGGTTC 


240 


CCCTGTCTGG 


ACTGCGCAGT 


GGTGAACCGC 


300 


GATGCCATCT 


GCGGGGACTG 


CTTGCCAGGA 


360 
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mrprnm imipn i 

TTTTAIAGGA 


APAfPA A APT 

AbAbbAAAbl 


TPTPPPPTTT 

lblbbbbl 1 1 


PAAP.APATP.fJ AfyrfJTfJTflPP TTftTnflAftAP 
LAAuAUA 1 uu Au lUlul uoU 1 1 u 1 uu/iuALr 




CCTCCTCCTb 


nrprn A ftp A A PP 

bllAbbAAbb 


PP APTPTPPP 

bbAblblbbb 


APPAAPP.TPA APPTPftTflAA flATPflPP/PPP 
AbbAALrulUA ALv/ll/U iuA/i UAluubUlUU 


i ±o\J 


ACGGbbTbbA 


PPPP KPPPP A 

bbbbALbbbA 


p apppppptp, 
bAbbbbub 1 b 


PPTYlPPflTTA TPTftPAHPflP TPTflflPPAPP 
bblbbbullA i^IuUAuUUO il/lUuul/AtrU 


04:11 


GTCCTGCTGG 


CCCTGLTbAl 


PPTPTPTPTP 

bblblblblb 


ATPTATTPTA APAPAPAPTT TATPPAPAAP 
AlblAliblA AbAbAbAbll lAlbbAbAAb 


DUU 


AAACCCAGCT 


PPTPTPTPPP 

GGTbTLIGbb 


PTP AP APP AP 

GlbAbAbbAb 


ATTPAPTAPA APP.P.PTPTP.A fJPTP/FPPrTPT 
AllbAblAbA AbbbblblbA ubluloLrlul 


DDU 


CTTGACAGAC 


PTP A P PTPP A 

CTbAGblbbA 


PP A AT ATPPP 

bbAAlAlbbb 


papapapppt pptp.ppap,th ppnppfyrnAP 

bAbAbAbbbi bblbbbAuiu UbbOL-uluAU 


790 
1 lAj 


TCAGTGCAGA 


nn r rr*ppppnn 

CCTGCGGGbb 


PPTPPPPTTP 

Gblbbbbllb 


PTPPPATPPA TPTPPTPTPA PP AHftPPTHP 
blbbbAlbbA IblbblblbA bbAbbbblbb 


(o\J 


a nnnnn a a no 

AGCCCCAACC 


fyor^nn a ptpt 

CGGCGALTbl 


TPPTTPTPPP 

Ibbllblbbb 


PTPPATTPTP PAPPPAPTPT TPAP,PPAAflA 
bl bbAI lb lb bAbbbAblbl IbAbbbAAuA 


oW 


AACGCAGGCC 


PAPPPPPPP A 

CAGCCGGGGA 


P ATPPTPPPP 

GATbGIbbbb 


APTTTPTTPP PATPPPTPAP PPAPTPPATP 
Ablllbllbb bAlbbblbAO bbAulULAlU 


yuu 


TGTGGCGAGT 


TTTCAGATGb 


PTPPPPTPTP 

blbbbblblb 


ATPPAP A ATP PPATPPPTPP TP.APAAPATP 
AlbbAbAAlb bbAlbbblbb luAbAAOAlU 


you 


TCTTTTTGTG 


I riTflTT A TOP 

ACTCTTATCb 


TP A A PTP A PT 

TGAAblbAbl 


PPAPAAPAPA TTPATTPTPT PAATPPAP.AA 

bbAbAAbAbA libAllblbl bAAl bbAbAA 


1 r\9(\ 


rtmnin AAA nnf* 

CTTGAAAGCT 


CAACGTCTTT 


PP A TTP A A A T 

bbAI IbAAAl 


APPAPTPAAP ATTTPPTTPP TPPP.P.PTP/FT 

AbbAblbAAb Alllbbllbb Ibbbbblbil 




CCAGTCCAGT 


CTCATTCTGA 


AAACTTTACA 


GCAGCTACTG ATTTATCTAG ATATAACAAC 


1140 


ACACTGGTAG 


AATCAGCATC 


AACTCAGGAT 


GCACTAACTA TGAGAAGCCA GCTAGATCAG 


1200 


bAbAblbbbb 


PTATP ATPP A 
blAlbAlbbA 


PPP APPP APT 
bbbAbbbAb 1 


PAflAPP/TPPP TPPAftftTAAn ftPARPHAPTft 


1 ?fifl 

1 U\J\J 


GGTTCCCTG 








1269 


SEQ ID NO.: 


: 3 








length: 1704 base pairs 






Type: 


nucleic acid 






Strandness : 


: single 








Topology: 


1 inear 









Molecule type: cDNA to mRNA 
Sequence 
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r\r\ r\ a i hot a o 

G6GAACGTAG 


AACTbTbbAA 


PA ATA A ATAP 

bAAlAAAlAb 


ATTTPATAAP, 
Al 1 luAlAAu 


A A APATPPPT 
AAAuA 1 uu\j I 


TTAAAAHTftP 


uu 


TACTAGAACA 


APAPAAAA PP 

AGAGAAAAbb 


TTTTTP A OTP 

lllllbAblb 


TTTT A PT A TT 
ill iAulAl 1 


APTAfJftPTAT 




Lu\J 


AAGTGACTTG 


mn a A A P A PP A 

TGAAACAGGA 


P APTPTAP AP 

bAblblAbAb 


APPA APA ATT 
AbbAAuAAl 1 


P APPPATPPP 
bAuVluAlLutr 


TPTP,P,A A APT 


1 QH 


GTGTTCCCTG 


P A A PP A PTPT 

CAACCAGTGT 


ppppp appp a 
bbbbbAbbbA 


TPP APTTPTP 

IbbAbl Iblb 


T A APPA ATPT 
lAAuuAAiul 




9/in 


ATGGGGAGGA 


TGCACAGTGT 


ptp apptppp 

GTGAbGlUbb 


PPPTPP AP AP 

bbb 1 bbAbAb 


PTTPA APP AP 

bl IbAAbbAb 


P APTPPPPPT 

bAb 1 bbbbb 1 


oUU 


TCCAGAAATG 


CAAGCCCTGT 


ptpp a ptppp 

CTGGAbTGbG 


PAPTPPTPA A 

bAGlbblbAA 


PPPPTTTP A P 

bbbbl 1 IbAb 


A APPP A A ATT 
AAbbbAAAl 1 


ODU 


GTTCAGCCAC 


CAGTGATGCC 


ATCTGbbbGG 


A PTPPTTPPP 

Ablbbllbbb 


A P P A TTTT A T 
AbbAl 1 1 1A1 


APP A AP APPA 
AbbAAbAUuA 


i ±Ci\J 


AACTTGTCGG 


CTTTCAAGAC 


A H/l P A PTP TP 

ATGGAGiGIb 


TPPPTTPTPP 

Ibbbl Iblbb 


AP APPPTPPT 

AbAbbblbbl 


PPTPPTTAPP 




AACCGCACTG 


TGCCAGCAAG 


nmn A A PPTPP 

GTCAACCTbb 


TP A A P ATPPP 

1 bAAbAl bbb 


PTPP A PPPPP 

blbbAbbbbb 


TPPAPPPPAP 
IbbAubbUAU 




GGGACACGGC 


GCTGGCTGCC 


nmm A TPTPP A 

GTTATbTbbA 


pppptptppp 
bbbblblbbb 


P APPPTPPTP 

bAbbblbblb 


PTPPPPPTflP 

blbbbUUluU 


DUU 


TCATCCTCTG 


TGTCATCTAT 


rn nrn t A P A P A P 

TGTAAGAGAC 


A PTTT A TPP A 

AGTTTAiGGA 


P A APA A APPP 

bAAbAAAbbb 


APPTPPTPTP 
AbblbulLllr 


DOU 


TGCGGTCACA 


GGACATTCAG 


t a p a a pppph^ 

TACAAbbGb 1 


PTP A PPTPTP 

blbAbblblb 


PTPTPTTP AP 

b 1 b 1 b 1 1 b Ab 


APAPPTPAPP 
AbAbb 1 ^AuU 




TCCACGAATA 


TGCCCACAGA 


n n Afpn ptp P P 

GCCTGCTGbb 


a ptp ppppp p 
Ablbbbbbbb 


TP A PTP A PTP 

1 bAb 1 b Ab I b 


PAP APPTPPP 
b AbAbb 1 bbb 


/oil 


GGCCGGTGCG 


CTTGCTCCCA 


TCCATGTGbT 


PTP APP APPP 

blbAbbAbbb 


PTPP KPPPPP 

blbbAbbbbb 


A /iPPPPPPr A 
AAbbbuuuuA 




CTCTTGGTTG 


TGGGGTGCAT 


mrtTfi O A P PP A 

TCTGCAbbbA 


PTPTTP a ppp 

blbl IbAbbb 


A AP A A APPP A 

AAbAAAbbbA 


ppppHAprrn 

bbbbbAbbba 


yuu 


GGGAGATGGT 


GCCGACTTTC 


fpmnrirt ATPPP 

TTCGGATbbb 


tp a ppp kptp 
IbAbbbAblb 


P ATPTPTPPP 

bAlblblbbb 


P APTTTTP AP 
bAb 1111 0 AU 


you 


ATGCCTGGCC 


mnmrt i rnn/1 A O 

TCTGATGCAG 


A ATPPPATPP 

AATCCCATbb 


PTP PTP APA A 

blbblbAbAA 


P A TPTPTTTT 
bAlblbl 111 


TPTP APTPTT 
1 a 1 u AU 1 L I 1 




ATCCTGAACT 


n A ATfiP A P A A 

CACTGGAGAA 


PAPA TTP A TT 

GAbATIbAl 1 


PTPTP A ATPP 

blblbAAlbb 


APA APTTP A A 
Ab AAb 1 1 b AA 


APPTPA APPT 




CTTTGGATTC 


A A AfAPPAPT 

AAATAGCAGT 


P A A P A TTTP P 

bAAbAl 1 lbb 


TTPPTPPPPP 

1 Ibblbbbbb 


TPTTPP A PTP 
ibl IbbAulU 


PAPTPTPATT 
LAulUlOHl 1 




CTGAAAACTT 


rpAOAPPAP PT 1 

TACAGbAbb I 


A PTP A TTT A T 

AblbAl 1 lAi 


PT AP AT ATA A 

blAbAlAlAA 


PA APAPAPTP 
LrAAUALAU lu 


PTAPA ATPAH 




CATCAACTCA 


PP A TPP A PT A 

GGATGCACTA 


A PT ATP A P A A 

AblAlbAbAA 


PPP APPTAP A 
bbbAbblAuA 


TPAPPAPAPT 


PPPPPTATPA 


1 9fi0 


TCCACCCAGC 


CACTCAGACG 


TCCCTCCAGG 


AAGCTTAAAG 


AACCTGCTTC 


TTTCTGCAGT 


1320 


AGAAGCGTGT 


GCTGGAACCC 


AAAGAGTACT 


CCTTTGTTAG 


GCTTATGGAC 


TGAGCAGTCT 


1380 


GGACCTTGCA 


TGGCTTCTGG 


GGCAAAAATA 


AATCTGAACC 


AAACTGACGG 


CATTTGAAGC 


1440 


CTTTCAGCCA 


GTTGCTTCTG 


AGCCAGACCA 


GCTGTAAGCT 


GAAACCTCAA 


TGAATAACAA 


1500 
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GAAAAGACTC CAGGCCGACT CATGATACTC 
CCACAAAAGT GACTTCAAAG ACGGATGGGT 
ATAACAAGAA ACAGAAATGC CCTCATGCTT 
TGAAGACCCA GAGTATACTT TTTC 



TGCATCTTTC CTACATGAGA AGCTTCTCTG 1560 
TGAGCTGGCA GCCTATGAGA TTGTGGACAT 1620 
ATTTTCATGG TGATTGTGGT TTTACAAGAC 1680 

1704 



SEQ ID NO.: 4 
Length: 1704 base pairs 
Type: nucleic acid 

Strandness: single 
Topology: linear 
Molecule type: cDNA to mRNA 
Original source: 
Organism: Homo Sapiens 
Cell line: HAS303 
Feature 
Name/Key: CDS 
Location: 45.. 1295 
Identification method: 
Name /Key: sig peptide 
Location: 45.. 119 
Identification method: 
Name/Key: mat peptide 
Location: 120.. 1295 
Identification method: 
Sequecne 



45 



f n 

ry 



GGGAACGTAG AACTCTCCAA CAATAAATAC ATTTGATAAG AAAG ATG GCT TTA AAA 56 

Met Ala Leu Lys 
-25 

GTG CTA CTA GAA CAA GAG AAA ACG TTT TTC ACT CTT TTA GTA TTA CTA 104 
Val Leu Leu Glu Gin Glu Lys Thr Phe Phe Thr Leu Leu Val Leu Leu 

-20 -15 -10 

GGC TAT TTG TCA TGT AAA GTG ACT TGT GAA ACA GGA GAC TGT AGA CAG 152 
Gly Tyr Leu Ser Cys Lys Val Thr Cys Glu Thr Gly Asp Cyc Arg Gin 
-5 15 10 



□ 

S3 CAA GAA TTC AGG GAT CGG TCT GGA AAC TGT GTT CCC TGC AAC CAG TGT 200 



i Gin Glu Phe Arg Asp Arg Ser Gly Asn Cys Val Pro Cys Asn Gin Cys 

15 20 25 

GGG CCA GGC ATG GAG TTG TCT AAG GAA TGT GGC TTC GGC TAT GGG GAG 248 

^ 15 Gly Pro Gly Met Glu Leu Ser Lys Glu Cys Gly Phe Gly Tyr Gly Glu 
" 3 30 35 40 

GAT GCA CAG TGT GTG ACG TGC CGG CTG CAC AGG TTC AAG GAG GAC TGG 296 

Asp Ala Gin Cys Val Thr Cys Arg Leu His Arg Phe Lys Glu Asp Trp 
45 50 55 

2 0 GGC TTC CAG AAA TGC AAG CCC TGT CTG GAC TGC GCA GTG GTG AAC CGC 344 

Gly Phe Gin Lys Cys Lys Pro Cys Leu Asp Cys Ala Val Val Asn Arg 

60 65 70 75 

TTT CAG AAG GCA AAT TGT TCA GCC ACC AGT GAT GCC ATC TGC GGG GAC 392 

Phe Gin Lys Ala Asn Cys Ser Ala Thr Ser Asp Ala He Cys Gly Asp 

25 80 85 90 

46 



TGC TTG CCA GGA TTT TAT AGG AAG ACG AAA CTT GTC GGC TTT CAA GAC 
Cys Leu Pro Gly Phe Tyr Arg Lys Thr Lys Leu Val Gly Phe Gin Asp 

95 100 105 

ATG GAG TGT GTG CCT TGT GGA GAC CCT CCT CCT CCT TAC GAA CCG CAC 
Met Glu Cys Val Pro Cys Gly Asp Pro Pro Pro Pro Tyr Glu Pro His 

110 115 120 

TGT GCC AGC AAG GTC AAC CTC GTG AAG ATC GCG TCC ACG GCC TCC AGC 
Cys Ala Ser Lys Val Asn Leu Val Lys He Ala Ser Thr Ala Ser Ser 

125 130 135 

CCA CGG GAC ACG GCG CTG GCT GCC GTT ATC TGC AGC GCT CTG GCC ACC 
Pro Arg Asp Thr Ala Leu Ala Ala Val He Cys Ser Ala Leu Ala Thr 
140 145 150 155 

GTC CTG CTG GCC CTG CTC ATC CTC TGT GTC ATC TAT TGT AAG AGA CAG 
Val Leu Leu Ala Leu Leu He Leu Cys Val He Tyr Cys Lys Arg Gin 

160 165 170 

TTT ATG GAG AAG AAA CCC AGC TGG TCT CTG CGG TCA CAG GAC ATT CAG 
Phe Met Glu Lys Lys Pro Ser Trp Ser Leu Arg Ser Gin Asp He Gin 

175 180 185 

TAC AAC GGC TCT GAG CTG TCG TGT CTT GAC AGA CCT CAG CTC CAC GAA 
Tyr Asn Gly Ser Glu Leu Ser Cys Leu Asp Rro Arg Gin Leu His Glu 

190 195 200 

TAT GCC CAC AGA GCC TGC TGC CAG TGC CGC CGT GAC TCA GTG, CAG ACC 
Tyr Ala His Arg Ala Cys Cys Gin Cys Arg Arg Asp Ser Val Gin Thr 

205 210 215 

TGC GGG CCG GTG CGC TTG CTC CCA TCC ATG TGC TGT GAG GAG GCC TGC 
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ft 
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£3 

ry 



CG 

m 



10 



15 



20 



Cys Gly Pro Val 
220 

AGC CCC AAC CCG 
Ser Pro Asn Pro 



25 



CTT CAG 
Leu Gin 

TTC GGA 
Phe Gly 

CCT CTG 
Pro Leu 
285 
TCT TAT 
Ser Tyr 
300 

CTT GAA 
Leu Glu 

GGT GGG 
Gly Gly 

ACT GAT 
Thr Asp 



GCA AGA 
Ala Arg 
255 
TCC CTC 
Ser Leu 
270 

ATG CAG 
Met Gin 

CCT GAA 
Pro Glu 

AGC TCA 
Ser Ser 

GCT GTT 
Ala Val 
335 
TTA TCT 
Leu Ser 



Arg Leu 
225 
GCG ACT 
Ala Thr 
240 

AAC GCA 
Asn Ala 

ACG CAG 
Thr Gin 

AAT CCC 
Asn Pro 

CTC ACT 
Leu Thr 
305 
ACG TCT 
Thr Ser 
320 

CCA GTC 
Pro Val 

AGA TAT 
Arg Tyr 



Leu Pro 

CTT GGT 
Leu Gly 

GGC CCA 
Gly Pro 

TCC ATC 
Ser He 
275 
ATG GGT 
Met Gly 
290 

GGA GAA 
Gly Glu 

TTG GAT 
Leu Asp 

CAG TCT 
Gin Ser 

AAC AAC 
Asn Asn 



Ser Met Cys 
230 

TGT GGG GTG 
Cys Gly Val 

245 
GCC GGG GAG 
Ala Gly Glu 
260 

TGT GGC GAG 
Cys Gly Glu 

GGT GAC AAC 
Gly Asp Asn 

GAC ATT CAT 
Asp He His 
310 

TCA AAT AGC 
Ser Asn Ser 

325 
CAT TCT GAA 
His Ser Glu 
340 

ACA CTG GTA 
Thr Leu Val 
48 



Cys Glu 

CAT TCT 
His Ser 

ATG GTG 
Met Val 

TTT TCA 
Phe Ser 
280 
ATC TCT 
He Ser 
295 

TCT CTC 
Ser Leu 

AGT CAA 
Ser Gin 

AAC TTT 
Asn Phe 

GAA TCA 
Glu Ser 



Glu Ala Cys 
235 

GCA GCC AGT 
Ala Ala Ser 

250 
CCG ACT TTC 
Pro Thr Phe 
265 

GAT GCC TGG 
Asp Ala Trp 

TTT TGT GAC 
Phe Cys Asp 

AAT CCA GAA 
Asn Pro Glu 
315 

GAT TTG GTT 
Asp Leu Val 

330 
ACA GCA GCT 
Thr Ala Ala 
345 

GCA TCA ACT 
Ala Ser Thr 



872 



920 



968 



1016 



1064 



1112 



1160 



1208 



350 355 360 

CAG GAT GCA CTA ACT ATG AGA AGC CAG CTA GAT CAG GAG AGT GGC GCT 1256 
Gin Asp Ala Leu Thr Met Arg Ser Gin Leu Asp Gin Glu Ser Gly Ala 
365 370 375 

5 ATC ATC CAC CCA GCC ACT CAG ACG TCC CTC CAG GAA GCT TAAAGAACCT 1305 
He He His Pro Ala Thr Gin Thr Ser Leu Gin Glu Ala 
380 385 390 

GCTTCTTTCT GCAGTAGAAG CGTGTGCTGG AACCCAAAGA GTACTCCTTT GTTAGGCTTA 1365 
TGGACTGAGC AGTCTGGACC TTGCATGGCT TCTGGGGCAA AAATAAATCT GAACCAAACT 1425 
Lj 10 GACGGCATTT GAAGCCTTTC AGCCAGTTGC TTCTGAGCCA GACCAGCTGT AAGCTGAAAC 1485 

m 

£3 CTCAATGAAT AACAAGAAAA GACTCCAGGC CGACTCATGA TACTCTGCAT CTTTCCTACA 1545 

rU 

H J TGAGAAGCTT CTCTGCCACA AAAGTGACTT CAAAGACGGA TGGGTTGAGC TGGCAGCCTA 1605 

TGAGATTGTG GACATATAAC AAGAAACAGA AATGCCCTCA TGCTTATTTT CATGGTGATT 1665 

CO GTGGTTTTAC AAGACTGAAG ACCCAGAGTA TACTTTTTC 1704 

ry 

1 15 

SEQ ID NO.: 5 
Length: 423 amino acids 
Type: amino acid 

2 0 Topology: linear 
Molecule type: protein 
Sequence 

Met Ala Leu Lys Val Leu Leu Glu Gin Glu Lys Thr Phe Phe Thr Leu 
25 1 5 10 15 

49 



!,! 10 



E. „3 

ru 

in 



a 3 



15 



20 



25 



Leu Val Leu Leu Gly Tyr Leu Ser Cys Lys Val Thr Cys Glu Thr Gly 

20 25 30 

Asp Cys Arg Gin Gin Glu Phe Arg Asp Arg Ser Gly Asn Cys Val Pro 

35 40 45 

Cys Asn Gin Cys Gly Pro Gly Met Glu Leu Ser Lys Glu Cys Gly Phe 

50 55 60 

Gly Tyr Gly Glu Asp Ala Gin Cys Val Thr Cys Arg Leu His Arg Phe 
65 70 75 80 

Lys Glu Asp Trp Gly Phe Gin Lys Cys Lys Pro Cys Leu Asp Cys Ala 

85 90 95 

Val Val Asn Arg Phe Gin Lys Ala Asn Cys Ser Ala Thr Ser Asp Ala 

100 105 110 

He Cys Gly Asp Cys Leu Pro Gly Phe Tyr Arg Lys Thr Lys Leu Val 

115 120 125 

Gly Phe Gin Asp Met Glu Cys Val Pro Cys Gly Asp Pro Pro Pro Pro 

130 135 140 

Tyr Glu Pro His Cys Ala Ser Lys Val Asn Leu Val Lys He Ala Ser 
145 150 155 160 

Thr Ala Ser Ser Pro Arg Asp Thr Ala Leu Ala Ala Val He Cys Ser 

165 170 175 

Ala Leu Ala Thr Val Leu Leu Ala Leu Leu He Leu Cys Val He Tyr 

180 185 190 

Cys Lys Arg Gin Phe Met Glu Lys Lys Pro Ser Trp Ser Leu Arg Ser 

195 200 205 

Gin Asp He Gin Tyr Asn Gly Ser Glu Leu Ser Cys Leu Asp Pro Arg 

50 



# • 



210 215 220 

Gin Leu His Glu Tyr Ala His Arg Ala Cys Cys Gin Cys Arg Arg Asp 
225 230 235 240 

Ser Val Gin Thr Cys Gly Pro Val Arg Leu Leu Pro Ser Met Cys Cys 
5 245 250 255 

Glu Glu Ala Cys Ser Pro Asn Pro Ala Thr Leu Gly Cys Gly Val His 

260 265 270 

Ser Ala Ala Ser Leu Gin Ala Arg Asn Ala Gly Pro Ala Gly Glu Met 
„ 275 280 285 

hits? 

l =5 

la 10 Val Pro Thr Phe Phe Gly Ser Leu Thr Gin Ser lie Cys Gly Glu Phe 

p 290 295 300 

r y 

H Ser Asp Ala Trp Pro Leu Met Gin Asn Pro Met Gly Gly Asp Asn He 

i 305 310 315 320 

Ser Phe Cys Asp Ser Tyr Pro Glu Leu Thr Gly Glu Asp He His Ser 
"^ 15 325 330 335 

* 3 Leu Asn Pro Glu Leu Glu Ser Ser Thr Ser Leu Asp Ser Asn Ser Ser 

340 345 350 

Gin Asp Leu Val Gly Gly Ala Val Pro Val Gin Ser His Ser Glu Asn 
355 360 365 

2 0 Phe Thr Ala Ala Thr Asp Leu Ser Arg Tyr Asn Asn Thr Leu Val Glu 
370 375 380 

Ser Ala Ser Thr Gin Asp Ala Leu Thr Met Arg Ser Gin Leu Asp Gin 
385 390 395 400 

Glu Ser Gly Ala He He His Pro Ala Thr Gin Thr Ser Leu Gin Val 
25 405 410 415 

51 

/ 



Arg Gin Arg Leu Gly Ser Leu 
420 

SEQ ID NO, : 6 
Length: 1269 base pairs 
Type: nucleic acid 



Strandness : 


single 




Topology: 


1 inear 




Molecule type: cDNA to mRNA 


oequecne 








rt/iu IUVj 1/\V_* 1 


AOAAPAAGAG 


GGCTATTTGT 


CATGTAAAGT 


GACTTGTGAA 


GATCGGTCTG 


GAAACTGTGT 


TCCCTGCAAC 


GAATGTGGCT 


TCGGCTATGG 


GGAGGATGCA 


AAGGAGGACT 


GGGGCTTCCA 


GAAATGCAAG 


TTTCAGAAGG 


CAAATTGTTC 


AGCCACCAGT 


TTTTATAGGA 


AGACGAAACT 


TGTCGGCTTT 


CCTCCTCCTC 


CTTACGAACC 


GCACTGTGCC 


ACGGCCTCCA 


GCCCACGGGA 


CACGGCGCTG 


GTCCTGCTGG 


CCCTGCTCAT 


CCTCTGTGTC 


AAACCCAGCT 


GGTCTCTGCG 


GTCACAGGAC 


CTTGACAGAC 


CTCAGCTCCA 


CGAATATGCC 


TCAGTGCAGA 


CCTGCGGGCC 


GGTGCGCTTG 


AGCCCCAACC 


CGGCGACTCT 


TGGTTGTGGG 



AAAACGTTTT TCACTCTTTT AGTATTACTA 60 

ACAGGAGACT GTAGACAGCA AGAATTCAGG 120 

CAGTGTGGGC CAGGCATGGA GTTGTCTAAG 180 

CAGTGTGTGA CGTGCCGGCT GCACAGGTTC 240 

CCCTGTCTGG ACTGCGCAGT GGTGAACCGC 300 

GATGCCATCT GCGGGGACTG CTTGCCAGGA 360 

CAAGACATGG AGTGTGTGCC TTGTGGAGAC 420 

AGCAAGGTCA ACCTCGTGAA GATCGCGTCC 480 

GCTGCCGTTA TCTGCAGCGC TCTGGCCACC 540 

ATCTATTGTA AGAGACAGTT TATGGAGAAG 600 

ATTCAGTACA ACGGCTCTGA GCTGTCGTGT 660 

CACAGAGCCT GCTGCCAGTG CCGCCGTGAC 720 

CTCCCATCCA TGTGCTGTGA GGAGGCCTGC 780 

GTGCATTCTG CAGCCAGTCT TCAGGCAAGA 840 
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AACGCAGGCC CAGCCGGGGA GATGGTGCCG 
TGTGGCGAGT TTTCAGATGC CTGGCCTCTG 
TCTTTTTGTG ACTCTTATCC TGAACTCACT 
CTTGAAAGCT CAACGTCTTT GGATTCAAAT 
CCAGTCCAGT CTCATTCTGA AAACTTTACA 
ACACTGGTAG AATCAGCATC AACTCAGGAT 
GAGAGTGGCG CTATCATCCA CCCAGCCACT 
GGTTCCCTG 




ACTTTCTTCG GATCCCTCAC GCAGTCCATC 900 

ATGCAGAATC CCATGGGTGG TGACAACATC 960 

GGAGAAGACA TTCATTCTCT CAATCCAGAA 1020 

AGCAGTCAAG ATTTGGTTGG TGGGGCTGTT 1080 

GCAGCTACTG ATTTATCTAG ATATAACAAC 1140 

GCACTAACTA TGAGAAGCCA GCTAGATCAG 1200 

CAGACGTCCC TCCAGGTAAG GCAGCGACTG 1260 

1269 



SEQ ID NO. : 7 
Length: 1496 base pairs 
Type: nucleic acid 

Strandness: single 
Topology: linear 
Molecule type: cDNA to mRNA 
Sequence 

GGGAACGTAG AACTCTCCAA CAATAAATAC ATTTGATAAG AAAGATGGCT TTAAAAGTGC 60 
TACTAGAACA AGAGAAAACG TTTTTCACTC TTTTAGTATT ACTAGGCTAT TTGTCATGTA 120 
AAGTGACTTG TGAAACAGGA GACTGTAGAC AGCAAGAATT CAGGGATCGG TCTGGAAACT 180 
GTGTTCCCTG CAACCAGTGT GGGCCAGGCA TGGAGTTGTC TAAGGAATGT GGCTTCGGCT 240 
ATGGGGAGGA TGCACAGTGT GTGACGTGCC GGCTGCACAG GTTCAAGGAG GACTGGGGCT 300 
TCCAGAAATG CAAGCCCTGT CTGGACTGCG CAGTGGTGAA CCGCTTTCAG AAGGCAAATT 360 
GTTCAGCCAC CAGTGATGCC ATCTGCGGGG ACTGCTTGCC AGGATTTTAT AGGAAGACGA 420 
AACTTGTCGG CTTTCAAGAC ATGGAGTGTG TGCCTTGTGG AGACCCTCCT CCTCCTTACG 480 
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AACCGCACTG TGCCAGCAAG GTCAACCTCG 
GGGACACGGC GCTGGCTGCC GTTATCTGCA 
TCATCCTCTG TGTCATCTAT TGTAAGAGAC 
TGCGGTCACA GGACATTCAG TACAACGGCT 
5 TCCACGAATA TGCCCACAGA GCCTGCTGCC 
GGCCGGTGCG CTTGCTCCCA TCCATGTGCT 
CTCTTGGTTG TGGGGTGCAT TCTGCAGCCA 
GGGAGATGGT GCCGACTTTC TTCGGATCCC 
ATGCCTGGCC TCTGATGCAG AATCCCATGG 

S3 

[J 10 ATCCTGAACT CACTGGAGAA GACATTCATT 

in CTTTGGATTC AAATAGCAGT CAAGATTTGG 

w 

fit 

K\ CTGAAAACTT TACAGCAGCT ACTGATTTAT 

Cfi 

3 CATCAACTCA GGATGCACTA ACTATGAGAA 

CO TCCACCCAGC CACTCAGACG TCCCTCCAGG 

^ 15 AGCACTGACT TACAGTAGAT CAGAACTCTG 

"'3 GAGTTTTTTT TTTGCATCTT TAATAATTTC 
ATTTCAAGTA TTTTTTTTAA AAACTAAAAA 



TP A AP ATPPP 

TGAAbArbbb 


ptpp Annnpp 


TPPAPPPP AP 




GCGCTCTGGb 


P A PPPTPPTP 

bAULblLLib 


PTPPPPPTPP 

LlbbbLLlbb 


con 


A PTTTATPP A 

AGTTrAlGbA 


PAAPA A A PPP 

bAAbAAAbbb 


APPTPPTPTP 

AbblbblLlL 


ooU 


CTGAGCTGTC 


PTPTPTTP AP 

GTGTLlTGAL 


A P A PPTP A PP 

AbALLILAbL 


nnn 

7Z0 


AGTGCCGCCG 


TP A PTP A PTP 

TGACTLAGTG 


papa nnTnnn 

LAGALLTGLb 


780 


GTGAGGAGGC 


ptpp a nr*nr*n 

CTGCAGCCCC 


a a nnnnnnn a 
AALLLbGLGA 


o40 


GTCTTCAGGC 


i ini i i PPP A 

AAGAAACGCA 


GbLbLAbLLb 


yuu 


TCACGCAGTC 


CATCTGTGGC 


P A pttttp a p 

GAGTTTICAb 


you 


nmfl nmO A 0 A A 

GTGGTGACAA 


CATCTCTTTT 


TPTP A PTPTT 

TGTGAL1L1 1 


1UZU 


CTCTCAATCC 


a n a a nmnip A A 

AGAACTTGAA 


A PPTP A A PPT 

AGCTLAALGT 


1 POrt 

lUoU 


TTGGTGGGGC 


A OTP 

TGTTCCAGTC 


P A PTPTP A TT 

GAGTL1LA1 I 


114U 


CTAGATATAA 


n A a n A P A PTP 

CAACACACTG 


PT A P A A TP A P 

GTAGAATCAb 


1 Of\f\ 

1ZUU 


GCCAGCTAGA 


mn A HO A r* A PT 

TCAGGAGAGT 


nr*r*r* pt a tp a 

GGLGLfATLA 


1 OCCi 

lZbU 


TAAGGCAGCG 


ACTGGGTTCC 


CTGTGAACAC 


1320 


TTCCCAGCAT 


AAGATTTGGG 


GGAACCTGAT 


1380 


TTGTATGTTG 


TAGAGTATGT 


TTTAAAATAA 


1440 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAA 


1496 



SEQ ID NO.: 
2 0 Length: 1496 b 
Type: 

Strandness: 
Topology: 
Molecule type: 



8 

se pairs 
nucleic acid 
single 
1 inear 

cDNA to mRNA 



54 



• 



Original source 
Organism: Homo Sapiens 



Cell line: 
Feature 
Name/Key: 
Location: 



HAS303 
CDS 

45,. 1313 



Identification method: 



Name/Key: sig peptide 
Location: 45.. 119 
Identification method: S 
Name /Key: mat peptide 
Location: 120.. 1313 
Identification method: S 
Sequence 



GGGAACGTAG AACTCTCCAA CAATAAATAC ATTTGATAAG AAAG ATG GCT TTA AAA 

Met Ala Leu Lys 
-25 

GTG CTA CTA GAA CAA GAG AAA ACG TTT TTC ACT CTT TTA GTA TTA CTA 
Val Leu Leu Glu Gin Glu Lys Thr Phe Phe Thr Leu Leu Val Leu Leu 

-20 -15 -10 

GGC TAT TTG TCA TGT AAA GTG ACT TGT GAA ACA GGA GAC TGT AGA CAG 
Gly Tyr Leu Ser Cys Lys Val Thr Cys Glu Thr Gly Asp Cyc Arg Gin 
-5 1 5 10 
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s ; 

S : i 

10 



£6 



10 



15 



20 



25 



CAA GAA TTC AGG GAT CGG TCT GGA AAC TGT GTT CCC TGC AAC CAG TGT 200 
Gin Glu Phe Arg Asp Arg Ser Gly Asn Cys Val Pro Cys Asn Gin Cys 

15 20 25 

GGG CCA GGC ATG GAG TTG TCT AAG GAA TGT GGC TTC GGC TAT GGG GAG 248 
Gly Pro Gly Met Glu Leu Ser Lys Glu Cys Gly Phe Gly Tyr Gly Glu 

30 35 40 

GAT GCA CAG TGT GTG ACG TGC CGG CTG CAC AGG TTC AAG GAG GAC TGG 296 
Asp Ala Gin Cys Val Thr Cys Arg Leu His Arg Phe Lys Glu Asp Trp 

45 50 55 

GGC TTC CAG AAA TGC AAG CCC TGT CTG GAC TGC GCA GTG GTG AAC CGC 344 
Gly Phe Gin Lys Cys Lys Pro Cys Leu Asp Cys Ala Val Val Asn Arg 
60 65 70 75 

TTT CAG AAG GCA AAT TGT TCA GCC ACC AGT GAT GCC ATC TGC GGG GAC 392 
Phe Gin Lys Ala Asn Cys Ser Ala Thr Ser Asp Ala lie Cys Gly Asp 

80 85 90 

TGC TTG CCA GGA TTT TAT AGG AAG ACG AAA CTT GTC GGC TTT CAA GAC 440 
Cys Leu Pro Gly Phe Tyr Arg Lys Thr Lys Leu Val Gly Phe Gin Asp 

95 100 105 

ATG GAG TGT GTG CCT TGT GGA GAC CCT CCT CCT CCT TAC GAA CCG CAC 488 
Met Glu Cys Val Pro Cys Gly Asp Pro Pro Pro Pro Tyr Glu Pro His 

110 115 120 

TGT GCC AGC AAG GTC AAC CTC GTG AAG ATC GCG TCC ACG GCC TCC AGC 536 
Cys Ala Ser Lys Val Asn Leu Val Lys He Ala Ser Thr Ala Ser Ser 

125 130 135 

CCA CGG GAC ACG GCG CTG GCT GCC GTT ATC TGC AGC GCT CTG GCC ACC 584 

56 




ru 



Pro Arg Asp Thr Ala 
140 

GTC CTG CTG GCC CTG 
Val Leu Leu Ala Leu 
5 160 
TTT ATG GAG AAG AAA 
Phe Met Glu Lys Lys 
175 

TAC AAC GGC TCT GAG 
10 Tyr Asn Gly Ser Glu 
190 

TAT GCC CAC AGA GCC 
Tyr Ala His Arg Ala 
205 

15 TGC GGG CCG GTG CGC 
Cys Gly Pro Val Arg 
220 

AGC CCC AAC CCG GCG 
Ser Pro Asn Pro Ala 
2 0 240 
CTT CAG GCA AGA AAC 
Leu Gin Ala Arg Asn 
255 

TTC GGA TCC CTC ACG 
25 Phe Gly Ser Leu Thr 



Leu Ala Ala Val 
145 

CTC ATC CTC TGT 
Leu lie Leu Cys 

CCC AGC TGG TCT 
Pro Ser Trp Ser 
180 

CTG TCG TGT CTT 
Leu Ser Cys Leu 
195 

TGC TGC CAG TGC 
Cys Cys Gin Cys 
210 

TTG CTC CCA TCC 
Leu Leu Pro Ser 
225 

ACT CTT GGT TGT 
Thr Leu Gly Cys 

GCA GGC CCA GCC 
Ala Gly Pro Ala 
260 

CAG TCC ATC TGT 
Gin Ser He Cys 



He Cys Ser Ala Leu Ala Thr 
150 155 
GTC ATC TAT TGT AAG AGA CAG 632 
Val He Tyr Cys Lys Arg Gin 
165 170 
CTG CGG TCA CAG GAC ATT CAG 680 
Leu Arg Ser Gin Asp He Gin 
185 

GAC AGA CCT CAG CTC CAC GAA 728 
Asp Rro Arg Gin Leu His Glu 
200 

CGC CGT GAC TCA GTG CAG ACC 776 
Arg Arg Asp Ser Val Gin Thr 
215 

ATG TGC TGT GAG GAG GCC TGC 824 
Met Cys Cys Glu Glu Ala Cys 
230 235 
GGG GTG CAT TCT GCA GCC AGT 872 
Gly Val His Ser Ala Ala Ser 
245 250 
GGG GAG ATG GTG CCG ACT TTC 920 
Gly Glu Met Val Pro Thr Phe 
265 

GGC GAG TTT TCA GAT GCC TGG 968 
Gly Glu Phe Ser Asp Ala Trp 
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270 275 280 

CCT CTG ATG CAG AAT CCC ATG GGT GGT GAC AAC ATC TCT TTT TGT GAC 1016 
Pro Leu Met Gin Asn Pro Met Gly Gly Asp Asn lie Ser Phe Cys Asp 

285 290 295 

TCT TAT CCT GAA CTC ACT GGA GAA GAC ATT CAT TCT CTC AAT CCA GAA 1064 
Ser Tyr Pro Glu Leu Thr Gly Glu Asp He His Ser Leu Asn Pro Glu 
300 305 310 315 

CTT GAA AGC TCA ACG TCT TTG GAT TCA AAT AGC AGT CAA GAT TTG GTT 1112 
Leu Glu Ser Ser Thr Ser Leu Asp Ser Asn Ser Ser Gin Asp Leu Val 

320 325 330 

GGT GGG GCT GTT CCA GTC CAG TCT CAT TCT GAA AAC TTT ACA GCA GCT 1160 
Gly Gly Ala Val Pro Val Gin Ser His Ser Glu Asn Phe Thr Ala Ala 

335 340 345 

ACT GAT TTA TCT AGA TAT AAC AAC ACA CTG GTA GAA TCA GCA TCA ACT 1208 
Thr Asp Leu Ser Arg Tyr Asn Asn Thr Leu Val Glu Ser Ala Ser Thr 

350 355 360 

CAG GAT GCA CTA ACT ATG AGA AGC CAG CTA GAT CAG GAG AGT GGC GCT 1256 
Gin Asp Ala Leu Thr Met Arg Ser Gin Leu Asp Gin Glu Ser Gly Ala 

365 370 375 

ATC ATC CAC CCA GCC ACT CAG ACG TCC CTC CAG GTA AGG CAG CGA CTG 1304 
He He His Pro Ala Thr Gin Thr Ser Leu Gin Val Arg Gin Arg Leu 
380 385 390 395 

GGT TCC CTG TGAACACAG CACTGACTTA CAGTAGATCA GAACTCTGTT CCCAGCATAA 1362 
Gly Ser Leu 

GATTTGGGGG AACCTGATGA GTTTTTTTTT TGCATCTTTA ATAATTTCTT GTATGTTGTA 1422 
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GAGTATGTTTlTAAAATAAAT TTCAAGTATT TTTTTTAAAA ACTAAAAAAA AAAAAAAAAA 1482 
AAAAAAAAAA fcAAA 1496 



h. 3 

■E EST 



ry 



10 



^ is 



SEQ ID NO.: \ 9 
Length: 35 basjp pairs 
Type: tnucleoic acid 

Strandness: Isingle 
Topology: linear 
Sequence 

CGATTGAATT CTAGACdTGC CTCGAGNNNN NNNNN 

SEQ ID NO.: 10 
Length: 28 base pairs 
Type: nucljeic acid 

Strandness: single 
Topology: lines 
Sequence 

AGAAAGATGG CTTTAAAAGTl GCTACTAG 
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